Overview
Brought to you by YData
Dataset statistics
| Number of variables | 29 |
|---|---|
| Number of observations | 2169687 |
| Missing cells | 18568423 |
| Missing cells (%) | 29.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.1 GiB |
| Average record size in memory | 1.0 KiB |
Variable types
| DateTime | 2 |
|---|---|
| Categorical | 5 |
| Unsupported | 1 |
| Numeric | 9 |
| Text | 12 |
CONTRIBUTING FACTOR VEHICLE 4 is highly overall correlated with CONTRIBUTING FACTOR VEHICLE 5 | High correlation |
CONTRIBUTING FACTOR VEHICLE 5 is highly overall correlated with CONTRIBUTING FACTOR VEHICLE 4 | High correlation |
NUMBER OF CYCLIST KILLED is highly overall correlated with NUMBER OF PEDESTRIANS KILLED and 1 other fields | High correlation |
NUMBER OF MOTORIST INJURED is highly overall correlated with NUMBER OF PERSONS INJURED | High correlation |
NUMBER OF MOTORIST KILLED is highly overall correlated with NUMBER OF PERSONS KILLED | High correlation |
NUMBER OF PEDESTRIANS KILLED is highly overall correlated with NUMBER OF CYCLIST KILLED and 1 other fields | High correlation |
NUMBER OF PERSONS INJURED is highly overall correlated with NUMBER OF MOTORIST INJURED | High correlation |
NUMBER OF PERSONS KILLED is highly overall correlated with NUMBER OF CYCLIST KILLED and 2 other fields | High correlation |
NUMBER OF CYCLIST INJURED is highly imbalanced (92.0%) | Imbalance |
NUMBER OF CYCLIST KILLED is highly imbalanced (99.9%) | Imbalance |
CONTRIBUTING FACTOR VEHICLE 4 is highly imbalanced (90.9%) | Imbalance |
CONTRIBUTING FACTOR VEHICLE 5 is highly imbalanced (90.1%) | Imbalance |
BOROUGH has 670408 (30.9%) missing values | Missing |
ZIP CODE has 670677 (30.9%) missing values | Missing |
LATITUDE has 239855 (11.1%) missing values | Missing |
LONGITUDE has 239855 (11.1%) missing values | Missing |
LOCATION has 239855 (11.1%) missing values | Missing |
ON STREET NAME has 467859 (21.6%) missing values | Missing |
CROSS STREET NAME has 827830 (38.2%) missing values | Missing |
OFF STREET NAME has 1794202 (82.7%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 2 has 344595 (15.9%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 3 has 2013111 (92.8%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 4 has 2133997 (98.4%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 5 has 2159928 (99.6%) missing values | Missing |
VEHICLE TYPE CODE 2 has 428782 (19.8%) missing values | Missing |
VEHICLE TYPE CODE 3 has 2019068 (93.1%) missing values | Missing |
VEHICLE TYPE CODE 4 has 2135279 (98.4%) missing values | Missing |
VEHICLE TYPE CODE 5 has 2160231 (99.6%) missing values | Missing |
NUMBER OF PERSONS KILLED is highly skewed (γ1 = 32.98672025) | Skewed |
NUMBER OF PEDESTRIANS KILLED is highly skewed (γ1 = 41.28234763) | Skewed |
NUMBER OF MOTORIST KILLED is highly skewed (γ1 = 53.53005542) | Skewed |
COLLISION_ID has unique values | Unique |
ZIP CODE is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
NUMBER OF PERSONS INJURED has 1654274 (76.2%) zeros | Zeros |
NUMBER OF PERSONS KILLED has 2166423 (99.8%) zeros | Zeros |
NUMBER OF PEDESTRIANS INJURED has 2047534 (94.4%) zeros | Zeros |
NUMBER OF PEDESTRIANS KILLED has 2168039 (99.9%) zeros | Zeros |
NUMBER OF MOTORIST INJURED has 1842231 (84.9%) zeros | Zeros |
NUMBER OF MOTORIST KILLED has 2168418 (99.9%) zeros | Zeros |
Reproduction
| Analysis started | 2025-04-20 13:29:42.068171 |
|---|---|
| Analysis finished | 2025-04-20 13:31:31.029049 |
| Duration | 1 minute and 48.96 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
CRASH DATE
Date
| Distinct | 4672 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.6 MiB |
| Minimum | 2012-07-01 00:00:00 |
|---|---|
| Maximum | 2025-04-15 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
CRASH TIME
Date
| Distinct | 1440 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.6 MiB |
| Minimum | 2025-04-20 00:00:00 |
|---|---|
| Maximum | 2025-04-20 23:59:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
BOROUGH
Categorical
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 670408 |
| Missing (%) | 30.9% |
| Memory size | 133.1 MiB |
| BROOKLYN | |
|---|---|
| QUEENS | |
| MANHATTAN | |
| BRONX | |
| STATEN ISLAND |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 7.4511775 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BROOKLYN |
|---|---|
| 2nd row | BROOKLYN |
| 3rd row | BROOKLYN |
| 4th row | BRONX |
| 5th row | BROOKLYN |
Common Values
| Value | Count | Frequency (%) |
| BROOKLYN | 479090 | |
| QUEENS | 402093 | |
| MANHATTAN | 333177 | |
| BRONX | 222053 | 10.2% |
| STATEN ISLAND | 62866 | 2.9% |
| (Missing) | 670408 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| brooklyn | 479090 | |
| queens | 402093 | |
| manhattan | 333177 | |
| bronx | 222053 | |
| staten | 62866 | 4.0% |
| island | 62866 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1895322 | |
| O | 1180233 | |
| A | 1125263 | |
| E | 867052 | 7.8% |
| T | 792086 | 7.1% |
| R | 701143 | 6.3% |
| B | 701143 | 6.3% |
| L | 541956 | 4.9% |
| S | 527825 | 4.7% |
| Y | 479090 | 4.3% |
| Other values (9) | 2360281 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 11171394 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 1895322 | |
| O | 1180233 | |
| A | 1125263 | |
| E | 867052 | 7.8% |
| T | 792086 | 7.1% |
| R | 701143 | 6.3% |
| B | 701143 | 6.3% |
| L | 541956 | 4.9% |
| S | 527825 | 4.7% |
| Y | 479090 | 4.3% |
| Other values (9) | 2360281 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 11171394 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 1895322 | |
| O | 1180233 | |
| A | 1125263 | |
| E | 867052 | 7.8% |
| T | 792086 | 7.1% |
| R | 701143 | 6.3% |
| B | 701143 | 6.3% |
| L | 541956 | 4.9% |
| S | 527825 | 4.7% |
| Y | 479090 | 4.3% |
| Other values (9) | 2360281 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 11171394 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 1895322 | |
| O | 1180233 | |
| A | 1125263 | |
| E | 867052 | 7.8% |
| T | 792086 | 7.1% |
| R | 701143 | 6.3% |
| B | 701143 | 6.3% |
| L | 541956 | 4.9% |
| S | 527825 | 4.7% |
| Y | 479090 | 4.3% |
| Other values (9) | 2360281 |
ZIP CODE
Unsupported
Missing  Rejected  Unsupported 
| Missing | 670677 |
|---|---|
| Missing (%) | 30.9% |
| Memory size | 77.6 MiB |
LATITUDE
Real number (ℝ)
Missing 
| Distinct | 128580 |
|---|---|
| Distinct (%) | 6.7% |
| Missing | 239855 |
| Missing (%) | 11.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.611136 |
| Minimum | 0 |
|---|---|
| Maximum | 43.344444 |
| Zeros | 5346 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 40.596296 |
| Q1 | 40.667454 |
| median | 40.720434 |
| Q3 | 40.769615 |
| 95-th percentile | 40.861942 |
| Maximum | 43.344444 |
| Range | 43.344444 |
| Interquartile range (IQR) | 0.102161 |
Descriptive statistics
| Standard deviation | 2.1419157 |
|---|---|
| Coefficient of variation (CV) | 0.052742078 |
| Kurtosis | 354.99951 |
| Mean | 40.611136 |
| Median Absolute Deviation (MAD) | 0.0513358 |
| Skewness | -18.881275 |
| Sum | 78372670 |
| Variance | 4.5878029 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5346 | 0.2% |
| 40.861862 | 918 | < 0.1% |
| 40.696033 | 803 | < 0.1% |
| 40.608757 | 739 | < 0.1% |
| 40.8047 | 702 | < 0.1% |
| 40.798256 | 647 | < 0.1% |
| 40.759308 | 635 | < 0.1% |
| 40.675735 | 593 | < 0.1% |
| 40.6960346 | 587 | < 0.1% |
| 40.658577 | 556 | < 0.1% |
| Other values (128570) | 1918306 | |
| (Missing) | 239855 | 11.1% |
| Value | Count | Frequency (%) |
| 0 | 5346 | |
| 30.78418 | 1 | < 0.1% |
| 34.783634 | 1 | < 0.1% |
| 40.498947 | 1 | < 0.1% |
| 40.4989488 | 2 | < 0.1% |
| 40.4991346 | 1 | < 0.1% |
| 40.49931 | 1 | < 0.1% |
| 40.4994787 | 1 | < 0.1% |
| 40.499659 | 1 | < 0.1% |
| 40.499672 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 43.344444 | 1 | < 0.1% |
| 42.64154 | 1 | < 0.1% |
| 42.318317 | 1 | < 0.1% |
| 42.107204 | 1 | < 0.1% |
| 41.91661 | 1 | < 0.1% |
| 41.34796 | 1 | < 0.1% |
| 41.258785 | 1 | < 0.1% |
| 41.12615 | 5 | |
| 41.12421 | 1 | < 0.1% |
| 41.061634 | 2 | < 0.1% |
LONGITUDE
Real number (ℝ)
Missing 
| Distinct | 99833 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 239855 |
| Missing (%) | 11.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.721994 |
| Minimum | -201.35999 |
|---|---|
| Maximum | 0 |
| Zeros | 5346 |
| Zeros (%) | 0.2% |
| Negative | 1924486 |
| Negative (%) | 88.7% |
| Memory size | 16.6 MiB |
Quantile statistics
| Minimum | -201.35999 |
|---|---|
| 5-th percentile | -74.038086 |
| Q1 | -73.974648 |
| median | -73.92693 |
| Q3 | -73.86668 |
| 95-th percentile | -73.76282 |
| Maximum | 0 |
| Range | 201.35999 |
| Interquartile range (IQR) | 0.107968 |
Descriptive statistics
| Standard deviation | 4.0013414 |
|---|---|
| Coefficient of variation (CV) | -0.054276088 |
| Kurtosis | 372.95801 |
| Mean | -73.721994 |
| Median Absolute Deviation (MAD) | 0.05265 |
| Skewness | 15.556815 |
| Sum | -1.4227106 × 108 |
| Variance | 16.010733 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5346 | 0.2% |
| -73.89063 | 790 | < 0.1% |
| -74.038086 | 740 | < 0.1% |
| -73.91282 | 720 | < 0.1% |
| -73.98453 | 718 | < 0.1% |
| -73.89686 | 685 | < 0.1% |
| -73.91243 | 658 | < 0.1% |
| -73.94476 | 627 | < 0.1% |
| -73.9112 | 598 | < 0.1% |
| -73.9845292 | 587 | < 0.1% |
| Other values (99823) | 1918363 | |
| (Missing) | 239855 | 11.1% |
| Value | Count | Frequency (%) |
| -201.35999 | 1 | < 0.1% |
| -201.23706 | 105 | |
| -89.13527 | 1 | < 0.1% |
| -86.76847 | 1 | < 0.1% |
| -79.61955 | 1 | < 0.1% |
| -79.00183 | 1 | < 0.1% |
| -76.2634 | 1 | < 0.1% |
| -76.02163 | 1 | < 0.1% |
| -74.742 | 7 | < 0.1% |
| -74.25496 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 5346 | |
| -32.768513 | 16 | < 0.1% |
| -47.209625 | 3 | < 0.1% |
| -73.66301 | 1 | < 0.1% |
| -73.70055 | 2 | < 0.1% |
| -73.700584 | 11 | < 0.1% |
| -73.7005968 | 10 | < 0.1% |
| -73.7006 | 2 | < 0.1% |
| -73.70061 | 5 | < 0.1% |
| -73.70071 | 4 | < 0.1% |
LOCATION
Text
Missing 
| Distinct | 314263 |
|---|---|
| Distinct (%) | 16.3% |
| Missing | 239855 |
| Missing (%) | 11.1% |
| Memory size | 154.1 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 24 |
| Mean length | 22.72621 |
| Min length | 10 |
Unique
| Unique | 178274 ? |
|---|---|
| Unique (%) | 9.2% |
Sample
| 1st row | (40.62179, -73.970024) |
|---|---|
| 2nd row | (40.667202, -73.8665) |
| 3rd row | (40.683304, -73.917274) |
| 4th row | (40.709183, -73.956825) |
| 5th row | (40.86816, -73.83148) |
| Value | Count | Frequency (%) |
| 0.0 | 10692 | 0.3% |
| 40.861862 | 918 | < 0.1% |
| 40.696033 | 803 | < 0.1% |
| 73.89063 | 790 | < 0.1% |
| 74.038086 | 740 | < 0.1% |
| 40.608757 | 739 | < 0.1% |
| 73.91282 | 720 | < 0.1% |
| 73.98453 | 718 | < 0.1% |
| 40.8047 | 702 | < 0.1% |
| 73.89686 | 685 | < 0.1% |
| Other values (228402) | 3842157 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 4800580 | |
| 4 | 4162714 | 9.5% |
| . | 3859664 | 8.8% |
| 3 | 3654090 | 8.3% |
| 0 | 3556631 | 8.1% |
| 9 | 2813709 | 6.4% |
| 8 | 2765641 | 6.3% |
| 6 | 2734950 | 6.2% |
| 5 | 2187840 | 5.0% |
| ) | 1929832 | 4.4% |
| Other values (6) | 11392116 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 43857767 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 7 | 4800580 | |
| 4 | 4162714 | 9.5% |
| . | 3859664 | 8.8% |
| 3 | 3654090 | 8.3% |
| 0 | 3556631 | 8.1% |
| 9 | 2813709 | 6.4% |
| 8 | 2765641 | 6.3% |
| 6 | 2734950 | 6.2% |
| 5 | 2187840 | 5.0% |
| ) | 1929832 | 4.4% |
| Other values (6) | 11392116 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 43857767 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 7 | 4800580 | |
| 4 | 4162714 | 9.5% |
| . | 3859664 | 8.8% |
| 3 | 3654090 | 8.3% |
| 0 | 3556631 | 8.1% |
| 9 | 2813709 | 6.4% |
| 8 | 2765641 | 6.3% |
| 6 | 2734950 | 6.2% |
| 5 | 2187840 | 5.0% |
| ) | 1929832 | 4.4% |
| Other values (6) | 11392116 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 43857767 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 7 | 4800580 | |
| 4 | 4162714 | 9.5% |
| . | 3859664 | 8.8% |
| 3 | 3654090 | 8.3% |
| 0 | 3556631 | 8.1% |
| 9 | 2813709 | 6.4% |
| 8 | 2765641 | 6.3% |
| 6 | 2734950 | 6.2% |
| 5 | 2187840 | 5.0% |
| ) | 1929832 | 4.4% |
| Other values (6) | 11392116 |
ON STREET NAME
Text
Missing 
| Distinct | 21730 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 467859 |
| Missing (%) | 21.6% |
| Memory size | 153.8 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 28.981801 |
| Min length | 2 |
Unique
| Unique | 7668 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | WHITESTONE EXPRESSWAY |
|---|---|
| 2nd row | QUEENSBORO BRIDGE UPPER |
| 3rd row | OCEAN PARKWAY |
| 4th row | THROGS NECK BRIDGE |
| 5th row | BROOKLYN BRIDGE |
| Value | Count | Frequency (%) |
| avenue | 622661 | 15.8% |
| street | 532561 | 13.6% |
| east | 156789 | 4.0% |
| boulevard | 129814 | 3.3% |
| west | 117287 | 3.0% |
| parkway | 78412 | 2.0% |
| road | 69656 | 1.8% |
| expressway | 67146 | 1.7% |
| island | 32291 | 0.8% |
| queens | 28427 | 0.7% |
| Other values (5459) | 2093501 |
Most occurring characters
| Value | Count | Frequency (%) |
| 27652057 | ||
| E | 3792469 | 7.7% |
| A | 2027021 | 4.1% |
| T | 1890880 | 3.8% |
| R | 1731520 | 3.5% |
| N | 1478967 | 3.0% |
| S | 1463241 | 3.0% |
| U | 1005240 | 2.0% |
| O | 902527 | 1.8% |
| V | 885702 | 1.8% |
| Other values (65) | 6492417 | 13.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 49322041 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 27652057 | ||
| E | 3792469 | 7.7% |
| A | 2027021 | 4.1% |
| T | 1890880 | 3.8% |
| R | 1731520 | 3.5% |
| N | 1478967 | 3.0% |
| S | 1463241 | 3.0% |
| U | 1005240 | 2.0% |
| O | 902527 | 1.8% |
| V | 885702 | 1.8% |
| Other values (65) | 6492417 | 13.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 49322041 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 27652057 | ||
| E | 3792469 | 7.7% |
| A | 2027021 | 4.1% |
| T | 1890880 | 3.8% |
| R | 1731520 | 3.5% |
| N | 1478967 | 3.0% |
| S | 1463241 | 3.0% |
| U | 1005240 | 2.0% |
| O | 902527 | 1.8% |
| V | 885702 | 1.8% |
| Other values (65) | 6492417 | 13.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 49322041 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 27652057 | ||
| E | 3792469 | 7.7% |
| A | 2027021 | 4.1% |
| T | 1890880 | 3.8% |
| R | 1731520 | 3.5% |
| N | 1478967 | 3.0% |
| S | 1463241 | 3.0% |
| U | 1005240 | 2.0% |
| O | 902527 | 1.8% |
| V | 885702 | 1.8% |
| Other values (65) | 6492417 | 13.2% |
Missing 
| Distinct | 23740 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 827830 |
| Missing (%) | 38.2% |
| Memory size | 126.7 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 31 |
| Mean length | 22.28578 |
| Min length | 1 |
Unique
| Unique | 7263 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | 20 AVENUE |
|---|---|
| 2nd row | AVENUE K |
| 3rd row | HENRY HUDSON RIVER |
| 4th row | DECATUR STREET |
| 5th row | EAST 43 STREET |
| Value | Count | Frequency (%) |
| avenue | 578077 | 19.5% |
| street | 468679 | 15.8% |
| east | 114406 | 3.9% |
| west | 72276 | 2.4% |
| boulevard | 70450 | 2.4% |
| road | 56811 | 1.9% |
| place | 34629 | 1.2% |
| parkway | 27456 | 0.9% |
| 3 | 19477 | 0.7% |
| park | 18027 | 0.6% |
| Other values (5590) | 1508188 |
Most occurring characters
| Value | Count | Frequency (%) |
| 14177815 | ||
| E | 3023830 | 10.1% |
| T | 1497841 | 5.0% |
| A | 1471873 | 4.9% |
| R | 1183523 | 4.0% |
| N | 1109613 | 3.7% |
| S | 1023762 | 3.4% |
| U | 797667 | 2.7% |
| V | 737444 | 2.5% |
| O | 600727 | 2.0% |
| Other values (66) | 4280235 | 14.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 29904330 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 14177815 | ||
| E | 3023830 | 10.1% |
| T | 1497841 | 5.0% |
| A | 1471873 | 4.9% |
| R | 1183523 | 4.0% |
| N | 1109613 | 3.7% |
| S | 1023762 | 3.4% |
| U | 797667 | 2.7% |
| V | 737444 | 2.5% |
| O | 600727 | 2.0% |
| Other values (66) | 4280235 | 14.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 29904330 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 14177815 | ||
| E | 3023830 | 10.1% |
| T | 1497841 | 5.0% |
| A | 1471873 | 4.9% |
| R | 1183523 | 4.0% |
| N | 1109613 | 3.7% |
| S | 1023762 | 3.4% |
| U | 797667 | 2.7% |
| V | 737444 | 2.5% |
| O | 600727 | 2.0% |
| Other values (66) | 4280235 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 29904330 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 14177815 | ||
| E | 3023830 | 10.1% |
| T | 1497841 | 5.0% |
| A | 1471873 | 4.9% |
| R | 1183523 | 4.0% |
| N | 1109613 | 3.7% |
| S | 1023762 | 3.4% |
| U | 797667 | 2.7% |
| V | 737444 | 2.5% |
| O | 600727 | 2.0% |
| Other values (66) | 4280235 | 14.3% |
OFF STREET NAME
Text
Missing 
| Distinct | 246307 |
|---|---|
| Distinct (%) | 65.6% |
| Missing | 1794202 |
| Missing (%) | 82.7% |
| Memory size | 87.7 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 40 |
| Mean length | 34.983336 |
| Min length | 8 |
Unique
| Unique | 193285 ? |
|---|---|
| Unique (%) | 51.5% |
Sample
| 1st row | 61 Ed Koch queensborough bridge |
|---|---|
| 2nd row | 1211 LORING AVENUE |
| 3rd row | 344 BAYCHESTER AVENUE |
| 4th row | 2047 PITKIN AVENUE |
| 5th row | 480 DEAN STREET |
| Value | Count | Frequency (%) |
| avenue | 144519 | 11.6% |
| street | 132144 | 10.6% |
| east | 34848 | 2.8% |
| west | 25213 | 2.0% |
| boulevard | 22997 | 1.8% |
| road | 17137 | 1.4% |
| ave | 8546 | 0.7% |
| lot | 7881 | 0.6% |
| st | 7314 | 0.6% |
| parking | 7267 | 0.6% |
| Other values (28055) | 839100 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7070993 | ||
| E | 845169 | 6.4% |
| T | 464863 | 3.5% |
| A | 435893 | 3.3% |
| R | 359949 | 2.7% |
| N | 316356 | 2.4% |
| S | 306999 | 2.3% |
| 1 | 299444 | 2.3% |
| U | 213501 | 1.6% |
| V | 203631 | 1.6% |
| Other values (74) | 2618920 | 19.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 13135718 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 7070993 | ||
| E | 845169 | 6.4% |
| T | 464863 | 3.5% |
| A | 435893 | 3.3% |
| R | 359949 | 2.7% |
| N | 316356 | 2.4% |
| S | 306999 | 2.3% |
| 1 | 299444 | 2.3% |
| U | 213501 | 1.6% |
| V | 203631 | 1.6% |
| Other values (74) | 2618920 | 19.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 13135718 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 7070993 | ||
| E | 845169 | 6.4% |
| T | 464863 | 3.5% |
| A | 435893 | 3.3% |
| R | 359949 | 2.7% |
| N | 316356 | 2.4% |
| S | 306999 | 2.3% |
| 1 | 299444 | 2.3% |
| U | 213501 | 1.6% |
| V | 203631 | 1.6% |
| Other values (74) | 2618920 | 19.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 13135718 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 7070993 | ||
| E | 845169 | 6.4% |
| T | 464863 | 3.5% |
| A | 435893 | 3.3% |
| R | 359949 | 2.7% |
| N | 316356 | 2.4% |
| S | 306999 | 2.3% |
| 1 | 299444 | 2.3% |
| U | 213501 | 1.6% |
| V | 203631 | 1.6% |
| Other values (74) | 2618920 | 19.9% |
NUMBER OF PERSONS INJURED
Real number (ℝ)
High correlation  Zeros 
| Distinct | 32 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.32193113 |
| Minimum | 0 |
|---|---|
| Maximum | 43 |
| Zeros | 1654274 |
| Zeros (%) | 76.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.70984503 |
|---|---|
| Coefficient of variation (CV) | 2.2049593 |
| Kurtosis | 47.914741 |
| Mean | 0.32193113 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.1348047 |
| Sum | 698484 |
| Variance | 0.50387997 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1654274 | |
| 1 | 400107 | 18.4% |
| 2 | 75273 | 3.5% |
| 3 | 24667 | 1.1% |
| 4 | 9105 | 0.4% |
| 5 | 3498 | 0.2% |
| 6 | 1455 | 0.1% |
| 7 | 608 | < 0.1% |
| 8 | 273 | < 0.1% |
| 9 | 137 | < 0.1% |
| Other values (22) | 272 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1654274 | |
| 1 | 400107 | 18.4% |
| 2 | 75273 | 3.5% |
| 3 | 24667 | 1.1% |
| 4 | 9105 | 0.4% |
| 5 | 3498 | 0.2% |
| 6 | 1455 | 0.1% |
| 7 | 608 | < 0.1% |
| 8 | 273 | < 0.1% |
| 9 | 137 | < 0.1% |
| Value | Count | Frequency (%) |
| 43 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 24 | 3 | |
| 23 | 1 | < 0.1% |
| 22 | 3 |
NUMBER OF PERSONS KILLED
Real number (ℝ)
High correlation  Skewed  Zeros 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 31 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0015532416 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 2166423 |
| Zeros (%) | 99.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.041600263 |
|---|---|
| Coefficient of variation (CV) | 26.782866 |
| Kurtosis | 1822.8072 |
| Mean | 0.0015532416 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 32.98672 |
| Sum | 3370 |
| Variance | 0.0017305819 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2166423 | |
| 1 | 3129 | 0.1% |
| 2 | 84 | < 0.1% |
| 3 | 13 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| (Missing) | 31 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2166423 | |
| 1 | 3129 | 0.1% |
| 2 | 84 | < 0.1% |
| 3 | 13 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 5 | 2 | < 0.1% |
| 4 | 4 | < 0.1% |
| 3 | 13 | < 0.1% |
| 2 | 84 | < 0.1% |
| 1 | 3129 | 0.1% |
| 0 | 2166423 |
NUMBER OF PEDESTRIANS INJURED
Real number (ℝ)
Zeros 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.058721373 |
| Minimum | 0 |
|---|---|
| Maximum | 27 |
| Zeros | 2047534 |
| Zeros (%) | 94.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 27 |
| Range | 27 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.24833533 |
|---|---|
| Coefficient of variation (CV) | 4.2290451 |
| Kurtosis | 117.21302 |
| Mean | 0.058721373 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.4971306 |
| Sum | 127407 |
| Variance | 0.061670438 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2047534 | |
| 1 | 117672 | 5.4% |
| 2 | 3968 | 0.2% |
| 3 | 396 | < 0.1% |
| 4 | 65 | < 0.1% |
| 5 | 27 | < 0.1% |
| 6 | 11 | < 0.1% |
| 7 | 6 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2047534 | |
| 1 | 117672 | 5.4% |
| 2 | 3968 | 0.2% |
| 3 | 396 | < 0.1% |
| 4 | 65 | < 0.1% |
| 5 | 27 | < 0.1% |
| 6 | 11 | < 0.1% |
| 7 | 6 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 27 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| 8 | 2 | < 0.1% |
| 7 | 6 | < 0.1% |
| 6 | 11 | < 0.1% |
| 5 | 27 | |
| 4 | 65 |
NUMBER OF PEDESTRIANS KILLED
Real number (ℝ)
High correlation  Skewed  Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.00077061807 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 2168039 |
| Zeros (%) | 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.02837345 |
|---|---|
| Coefficient of variation (CV) | 36.819083 |
| Kurtosis | 2472.9705 |
| Mean | 0.00077061807 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 41.282348 |
| Sum | 1672 |
| Variance | 0.00080505268 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2168039 | |
| 1 | 1631 | 0.1% |
| 2 | 14 | < 0.1% |
| 4 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2168039 | |
| 1 | 1631 | 0.1% |
| 2 | 14 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 2 | 14 | < 0.1% |
| 1 | 1631 | 0.1% |
| 0 | 2168039 |
NUMBER OF CYCLIST INJURED
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 120.0 MiB |
| 0 | |
|---|---|
| 1 | 59495 |
| 2 | 686 |
| 3 | 25 |
| 4 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2109480 | |
| 1 | 59495 | 2.7% |
| 2 | 686 | < 0.1% |
| 3 | 25 | < 0.1% |
| 4 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2109480 | |
| 1 | 59495 | 2.7% |
| 2 | 686 | < 0.1% |
| 3 | 25 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2109480 | |
| 1 | 59495 | 2.7% |
| 2 | 686 | < 0.1% |
| 3 | 25 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2169687 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2109480 | |
| 1 | 59495 | 2.7% |
| 2 | 686 | < 0.1% |
| 3 | 25 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2169687 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2109480 | |
| 1 | 59495 | 2.7% |
| 2 | 686 | < 0.1% |
| 3 | 25 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2169687 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2109480 | |
| 1 | 59495 | 2.7% |
| 2 | 686 | < 0.1% |
| 3 | 25 | < 0.1% |
| 4 | 1 | < 0.1% |
NUMBER OF CYCLIST KILLED
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 120.0 MiB |
| 0 | |
|---|---|
| 1 | 262 |
| 2 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2169424 | |
| 1 | 262 | < 0.1% |
| 2 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2169424 | |
| 1 | 262 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2169424 | |
| 1 | 262 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2169687 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2169424 | |
| 1 | 262 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2169687 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2169424 | |
| 1 | 262 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2169687 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2169424 | |
| 1 | 262 | < 0.1% |
| 2 | 1 | < 0.1% |
NUMBER OF MOTORIST INJURED
Real number (ℝ)
High correlation  Zeros 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.23081209 |
| Minimum | 0 |
|---|---|
| Maximum | 43 |
| Zeros | 1842231 |
| Zeros (%) | 84.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.67103572 |
|---|---|
| Coefficient of variation (CV) | 2.9072815 |
| Kurtosis | 59.571808 |
| Mean | 0.23081209 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.9889082 |
| Sum | 500790 |
| Variance | 0.45028894 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1842231 | |
| 1 | 220044 | 10.1% |
| 2 | 68504 | 3.2% |
| 3 | 23905 | 1.1% |
| 4 | 8916 | 0.4% |
| 5 | 3444 | 0.2% |
| 6 | 1407 | 0.1% |
| 7 | 581 | < 0.1% |
| 8 | 263 | < 0.1% |
| 9 | 131 | < 0.1% |
| Other values (21) | 261 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1842231 | |
| 1 | 220044 | 10.1% |
| 2 | 68504 | 3.2% |
| 3 | 23905 | 1.1% |
| 4 | 8916 | 0.4% |
| 5 | 3444 | 0.2% |
| 6 | 1407 | 0.1% |
| 7 | 581 | < 0.1% |
| 8 | 263 | < 0.1% |
| 9 | 131 | < 0.1% |
| Value | Count | Frequency (%) |
| 43 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 24 | 3 | |
| 23 | 1 | < 0.1% |
| 22 | 2 | |
| 21 | 1 | < 0.1% |
NUMBER OF MOTORIST KILLED
Real number (ℝ)
High correlation  Skewed  Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.00063281017 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 2168418 |
| Zeros (%) | 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.027494185 |
|---|---|
| Coefficient of variation (CV) | 43.44776 |
| Kurtosis | 4006.357 |
| Mean | 0.00063281017 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 53.530055 |
| Sum | 1373 |
| Variance | 0.00075593019 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2168418 | |
| 1 | 1187 | 0.1% |
| 2 | 66 | < 0.1% |
| 3 | 12 | < 0.1% |
| 5 | 2 | < 0.1% |
| 4 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2168418 | |
| 1 | 1187 | 0.1% |
| 2 | 66 | < 0.1% |
| 3 | 12 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 2 | < 0.1% |
| 4 | 2 | < 0.1% |
| 3 | 12 | < 0.1% |
| 2 | 66 | < 0.1% |
| 1 | 1187 | 0.1% |
| 0 | 2168418 |
| Distinct | 61 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7488 |
| Missing (%) | 0.3% |
| Memory size | 158.1 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 43 |
| Mean length | 19.580861 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Aggressive Driving/Road Rage |
|---|---|
| 2nd row | Pavement Slippery |
| 3rd row | Unspecified |
| 4th row | Following Too Closely |
| 5th row | Passing Too Closely |
| Value | Count | Frequency (%) |
| unspecified | 730368 | |
| driver | 472824 | 10.9% |
| inattention/distraction | 438290 | 10.1% |
| closely | 171246 | 4.0% |
| too | 171246 | 4.0% |
| to | 155546 | 3.6% |
| failure | 136141 | 3.1% |
| yield | 129654 | 3.0% |
| right-of-way | 129654 | 3.0% |
| passing | 116503 | 2.7% |
| Other values (96) | 1674831 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 4751528 | 11.2% |
| e | 4307016 | 10.2% |
| n | 3682645 | 8.7% |
| t | 2947416 | 7.0% |
| o | 2505340 | 5.9% |
| r | 2498947 | 5.9% |
| s | 2193313 | 5.2% |
| 2164104 | 5.1% | |
| a | 2095204 | 4.9% |
| c | 1621715 | 3.8% |
| Other values (45) | 13570491 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 42337719 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 4751528 | 11.2% |
| e | 4307016 | 10.2% |
| n | 3682645 | 8.7% |
| t | 2947416 | 7.0% |
| o | 2505340 | 5.9% |
| r | 2498947 | 5.9% |
| s | 2193313 | 5.2% |
| 2164104 | 5.1% | |
| a | 2095204 | 4.9% |
| c | 1621715 | 3.8% |
| Other values (45) | 13570491 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 42337719 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 4751528 | 11.2% |
| e | 4307016 | 10.2% |
| n | 3682645 | 8.7% |
| t | 2947416 | 7.0% |
| o | 2505340 | 5.9% |
| r | 2498947 | 5.9% |
| s | 2193313 | 5.2% |
| 2164104 | 5.1% | |
| a | 2095204 | 4.9% |
| c | 1621715 | 3.8% |
| Other values (45) | 13570491 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 42337719 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 4751528 | 11.2% |
| e | 4307016 | 10.2% |
| n | 3682645 | 8.7% |
| t | 2947416 | 7.0% |
| o | 2505340 | 5.9% |
| r | 2498947 | 5.9% |
| s | 2193313 | 5.2% |
| 2164104 | 5.1% | |
| a | 2095204 | 4.9% |
| c | 1621715 | 3.8% |
| Other values (45) | 13570491 |
CONTRIBUTING FACTOR VEHICLE 2
Text
Missing 
| Distinct | 61 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 344595 |
| Missing (%) | 15.9% |
| Memory size | 132.5 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 11 |
| Mean length | 13.056416 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 1536370 | |
| driver | 105403 | 4.7% |
| inattention/distraction | 98376 | 4.4% |
| other | 34374 | 1.5% |
| vehicular | 33308 | 1.5% |
| too | 29279 | 1.3% |
| closely | 29279 | 1.3% |
| passing | 22633 | 1.0% |
| to | 22273 | 1.0% |
| lane | 21043 | 0.9% |
| Other values (96) | 308004 | 13.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 3754742 | |
| e | 3656072 | |
| n | 2136161 | |
| s | 1829805 | |
| c | 1734021 | |
| d | 1613337 | |
| p | 1609180 | |
| f | 1595364 | |
| U | 1574713 | |
| t | 645410 | 2.7% |
| Other values (45) | 3680355 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 23829160 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 3754742 | |
| e | 3656072 | |
| n | 2136161 | |
| s | 1829805 | |
| c | 1734021 | |
| d | 1613337 | |
| p | 1609180 | |
| f | 1595364 | |
| U | 1574713 | |
| t | 645410 | 2.7% |
| Other values (45) | 3680355 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 23829160 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 3754742 | |
| e | 3656072 | |
| n | 2136161 | |
| s | 1829805 | |
| c | 1734021 | |
| d | 1613337 | |
| p | 1609180 | |
| f | 1595364 | |
| U | 1574713 | |
| t | 645410 | 2.7% |
| Other values (45) | 3680355 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 23829160 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 3754742 | |
| e | 3656072 | |
| n | 2136161 | |
| s | 1829805 | |
| c | 1734021 | |
| d | 1613337 | |
| p | 1609180 | |
| f | 1595364 | |
| U | 1574713 | |
| t | 645410 | 2.7% |
| Other values (45) | 3680355 |
CONTRIBUTING FACTOR VEHICLE 3
Text
Missing 
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2013111 |
| Missing (%) | 92.8% |
| Memory size | 71.7 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 11 |
| Mean length | 11.662183 |
| Min length | 1 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 145876 | |
| other | 3048 | 1.8% |
| vehicular | 3008 | 1.8% |
| driver | 2275 | 1.3% |
| closely | 2148 | 1.3% |
| too | 2148 | 1.3% |
| following | 2089 | 1.2% |
| inattention/distraction | 2079 | 1.2% |
| fatigued/drowsy | 855 | 0.5% |
| pavement | 434 | 0.3% |
| Other values (82) | 6286 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 311985 | |
| i | 310458 | |
| n | 160122 | |
| s | 153239 | |
| c | 152712 | |
| d | 148071 | |
| p | 147649 | |
| f | 146846 | |
| U | 146602 | |
| o | 18344 | 1.0% |
| Other values (45) | 129990 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1826018 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 311985 | |
| i | 310458 | |
| n | 160122 | |
| s | 153239 | |
| c | 152712 | |
| d | 148071 | |
| p | 147649 | |
| f | 146846 | |
| U | 146602 | |
| o | 18344 | 1.0% |
| Other values (45) | 129990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1826018 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 311985 | |
| i | 310458 | |
| n | 160122 | |
| s | 153239 | |
| c | 152712 | |
| d | 148071 | |
| p | 147649 | |
| f | 146846 | |
| U | 146602 | |
| o | 18344 | 1.0% |
| Other values (45) | 129990 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1826018 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 311985 | |
| i | 310458 | |
| n | 160122 | |
| s | 153239 | |
| c | 152712 | |
| d | 148071 | |
| p | 147649 | |
| f | 146846 | |
| U | 146602 | |
| o | 18344 | 1.0% |
| Other values (45) | 129990 |
CONTRIBUTING FACTOR VEHICLE 4
Categorical
High correlation  Imbalance  Missing 
| Distinct | 43 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2133997 |
| Missing (%) | 98.4% |
| Memory size | 132.6 MiB |
| Unspecified | |
|---|---|
| Other Vehicular | 678 |
| Following Too Closely | 413 |
| Driver Inattention/Distraction | 294 |
| Fatigued/Drowsy | 170 |
| Other values (38) | 480 |
Length
| Max length | 43 |
|---|---|
| Median length | 11 |
| Mean length | 11.491734 |
| Min length | 5 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
Common Values
| Value | Count | Frequency (%) |
| Unspecified | 33655 | 1.6% |
| Other Vehicular | 678 | < 0.1% |
| Following Too Closely | 413 | < 0.1% |
| Driver Inattention/Distraction | 294 | < 0.1% |
| Fatigued/Drowsy | 170 | < 0.1% |
| Pavement Slippery | 125 | < 0.1% |
| Reaction to Uninvolved Vehicle | 44 | < 0.1% |
| Unsafe Speed | 34 | < 0.1% |
| Driver Inexperience | 31 | < 0.1% |
| Outside Car Distraction | 31 | < 0.1% |
| Other values (33) | 215 | < 0.1% |
| (Missing) | 2133997 |
Length
| Value | Count | Frequency (%) |
| unspecified | 33655 | |
| other | 687 | 1.8% |
| vehicular | 678 | 1.8% |
| too | 418 | 1.1% |
| closely | 418 | 1.1% |
| following | 413 | 1.1% |
| driver | 325 | 0.9% |
| inattention/distraction | 294 | 0.8% |
| fatigued/drowsy | 170 | 0.4% |
| pavement | 129 | 0.3% |
| Other values (67) | 1035 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 71167 | |
| i | 70477 | |
| n | 35875 | |
| c | 34920 | |
| s | 34868 | |
| p | 34047 | |
| d | 34019 | |
| f | 33788 | |
| U | 33771 | |
| o | 3254 | 0.8% |
| Other values (42) | 23954 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 410140 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 71167 | |
| i | 70477 | |
| n | 35875 | |
| c | 34920 | |
| s | 34868 | |
| p | 34047 | |
| d | 34019 | |
| f | 33788 | |
| U | 33771 | |
| o | 3254 | 0.8% |
| Other values (42) | 23954 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 410140 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 71167 | |
| i | 70477 | |
| n | 35875 | |
| c | 34920 | |
| s | 34868 | |
| p | 34047 | |
| d | 34019 | |
| f | 33788 | |
| U | 33771 | |
| o | 3254 | 0.8% |
| Other values (42) | 23954 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 410140 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 71167 | |
| i | 70477 | |
| n | 35875 | |
| c | 34920 | |
| s | 34868 | |
| p | 34047 | |
| d | 34019 | |
| f | 33788 | |
| U | 33771 | |
| o | 3254 | 0.8% |
| Other values (42) | 23954 | 5.8% |
CONTRIBUTING FACTOR VEHICLE 5
Categorical
High correlation  Imbalance  Missing 
| Distinct | 32 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 2159928 |
| Missing (%) | 99.6% |
| Memory size | 132.5 MiB |
| Unspecified | |
|---|---|
| Other Vehicular | 197 |
| Following Too Closely | 107 |
| Driver Inattention/Distraction | 68 |
| Pavement Slippery | 53 |
| Other values (27) | 135 |
Length
| Max length | 43 |
|---|---|
| Median length | 11 |
| Mean length | 11.467056 |
| Min length | 5 |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
Common Values
| Value | Count | Frequency (%) |
| Unspecified | 9199 | 0.4% |
| Other Vehicular | 197 | < 0.1% |
| Following Too Closely | 107 | < 0.1% |
| Driver Inattention/Distraction | 68 | < 0.1% |
| Pavement Slippery | 53 | < 0.1% |
| Fatigued/Drowsy | 41 | < 0.1% |
| Reaction to Uninvolved Vehicle | 12 | < 0.1% |
| Obstruction/Debris | 11 | < 0.1% |
| Alcohol Involvement | 11 | < 0.1% |
| Driver Inexperience | 10 | < 0.1% |
| Other values (22) | 50 | < 0.1% |
| (Missing) | 2159928 |
Length
| Value | Count | Frequency (%) |
| unspecified | 9199 | |
| other | 199 | 1.9% |
| vehicular | 197 | 1.9% |
| too | 109 | 1.0% |
| closely | 109 | 1.0% |
| following | 107 | 1.0% |
| driver | 78 | 0.7% |
| inattention/distraction | 68 | 0.7% |
| pavement | 54 | 0.5% |
| slippery | 53 | 0.5% |
| Other values (50) | 260 | 2.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 19487 | |
| i | 19225 | |
| n | 9758 | |
| c | 9546 | |
| s | 9489 | |
| p | 9333 | |
| d | 9285 | |
| f | 9227 | |
| U | 9222 | |
| o | 837 | 0.7% |
| Other values (41) | 6498 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 111907 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 19487 | |
| i | 19225 | |
| n | 9758 | |
| c | 9546 | |
| s | 9489 | |
| p | 9333 | |
| d | 9285 | |
| f | 9227 | |
| U | 9222 | |
| o | 837 | 0.7% |
| Other values (41) | 6498 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 111907 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 19487 | |
| i | 19225 | |
| n | 9758 | |
| c | 9546 | |
| s | 9489 | |
| p | 9333 | |
| d | 9285 | |
| f | 9227 | |
| U | 9222 | |
| o | 837 | 0.7% |
| Other values (41) | 6498 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 111907 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 19487 | |
| i | 19225 | |
| n | 9758 | |
| c | 9546 | |
| s | 9489 | |
| p | 9333 | |
| d | 9285 | |
| f | 9227 | |
| U | 9222 | |
| o | 837 | 0.7% |
| Other values (41) | 6498 | 5.8% |
COLLISION_ID
Real number (ℝ)
Unique 
| Distinct | 2169687 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3229115.9 |
| Minimum | 22 |
|---|---|
| Maximum | 4806433 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.6 MiB |
Quantile statistics
| Minimum | 22 |
|---|---|
| 5-th percentile | 109347.3 |
| Q1 | 3178541.5 |
| median | 3721118 |
| Q3 | 4263766.5 |
| 95-th percentile | 4697771.7 |
| Maximum | 4806433 |
| Range | 4806411 |
| Interquartile range (IQR) | 1085225 |
Descriptive statistics
| Standard deviation | 1507781.8 |
|---|---|
| Coefficient of variation (CV) | 0.46693332 |
| Kurtosis | 0.091475235 |
| Mean | 3229115.9 |
| Median Absolute Deviation (MAD) | 542613 |
| Skewness | -1.2508025 |
| Sum | 7.0061708 × 1012 |
| Variance | 2.273406 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4806253 | 1 | < 0.1% |
| 4455765 | 1 | < 0.1% |
| 4513547 | 1 | < 0.1% |
| 4675373 | 1 | < 0.1% |
| 4541903 | 1 | < 0.1% |
| 4566131 | 1 | < 0.1% |
| 4623759 | 1 | < 0.1% |
| 4675709 | 1 | < 0.1% |
| 4675769 | 1 | < 0.1% |
| 4623865 | 1 | < 0.1% |
| Other values (2169677) | 2169677 |
| Value | Count | Frequency (%) |
| 22 | 1 | |
| 23 | 1 | |
| 24 | 1 | |
| 25 | 1 | |
| 26 | 1 | |
| 27 | 1 | |
| 28 | 1 | |
| 29 | 1 | |
| 30 | 1 | |
| 31 | 1 |
| Value | Count | Frequency (%) |
| 4806433 | 1 | |
| 4806432 | 1 | |
| 4806429 | 1 | |
| 4806428 | 1 | |
| 4806426 | 1 | |
| 4806425 | 1 | |
| 4806423 | 1 | |
| 4806422 | 1 | |
| 4806409 | 1 | |
| 4806408 | 1 |
| Distinct | 1768 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 15354 |
| Missing (%) | 0.7% |
| Memory size | 152.2 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 35 |
| Mean length | 16.845762 |
| Min length | 1 |
Unique
| Unique | 1070 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Sedan |
|---|---|
| 2nd row | Sedan |
| 3rd row | Moped |
| 4th row | Sedan |
| 5th row | Station Wagon/Sport Utility Vehicle |
| Value | Count | Frequency (%) |
| vehicle | 912569 | |
| utility | 666102 | |
| station | 666056 | |
| sedan | 661932 | |
| wagon/sport | 485764 | |
| passenger | 416225 | |
| 181771 | 3.6% | |
| wagon | 180357 | 3.6% |
| sport | 180291 | 3.6% |
| truck | 90588 | 1.8% |
| Other values (1020) | 636005 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2936547 | 8.1% | |
| S | 2843291 | 7.8% |
| t | 2465415 | 6.8% |
| i | 2076116 | 5.7% |
| E | 1820373 | 5.0% |
| a | 1733286 | 4.8% |
| e | 1727059 | 4.8% |
| n | 1657129 | 4.6% |
| o | 1540424 | 4.2% |
| T | 1150164 | 3.2% |
| Other values (67) | 16341576 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 36291380 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2936547 | 8.1% | |
| S | 2843291 | 7.8% |
| t | 2465415 | 6.8% |
| i | 2076116 | 5.7% |
| E | 1820373 | 5.0% |
| a | 1733286 | 4.8% |
| e | 1727059 | 4.8% |
| n | 1657129 | 4.6% |
| o | 1540424 | 4.2% |
| T | 1150164 | 3.2% |
| Other values (67) | 16341576 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 36291380 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2936547 | 8.1% | |
| S | 2843291 | 7.8% |
| t | 2465415 | 6.8% |
| i | 2076116 | 5.7% |
| E | 1820373 | 5.0% |
| a | 1733286 | 4.8% |
| e | 1727059 | 4.8% |
| n | 1657129 | 4.6% |
| o | 1540424 | 4.2% |
| T | 1150164 | 3.2% |
| Other values (67) | 16341576 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 36291380 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2936547 | 8.1% | |
| S | 2843291 | 7.8% |
| t | 2465415 | 6.8% |
| i | 2076116 | 5.7% |
| E | 1820373 | 5.0% |
| a | 1733286 | 4.8% |
| e | 1727059 | 4.8% |
| n | 1657129 | 4.6% |
| o | 1540424 | 4.2% |
| T | 1150164 | 3.2% |
| Other values (67) | 16341576 |
Missing 
| Distinct | 1979 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 428782 |
| Missing (%) | 19.8% |
| Memory size | 134.3 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 30 |
| Mean length | 16.034862 |
| Min length | 1 |
Unique
| Unique | 1186 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Sedan |
|---|---|
| 2nd row | Sedan |
| 3rd row | Pick-up Truck |
| 4th row | Box Truck |
| 5th row | Station Wagon/Sport Utility Vehicle |
| Value | Count | Frequency (%) |
| vehicle | 672305 | |
| utility | 485333 | |
| station | 485301 | |
| sedan | 460130 | |
| wagon/sport | 345097 | |
| passenger | 318614 | |
| 141646 | 3.6% | |
| wagon | 140262 | 3.5% |
| sport | 140204 | 3.5% |
| truck | 90131 | 2.3% |
| Other values (1078) | 676997 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2228083 | 8.0% | |
| S | 2093987 | 7.5% |
| t | 1762335 | 6.3% |
| i | 1515114 | 5.4% |
| E | 1440961 | 5.2% |
| e | 1263386 | 4.5% |
| a | 1231686 | 4.4% |
| n | 1170611 | 4.2% |
| o | 1124916 | 4.0% |
| T | 926688 | 3.3% |
| Other values (63) | 13157405 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 27915172 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2228083 | 8.0% | |
| S | 2093987 | 7.5% |
| t | 1762335 | 6.3% |
| i | 1515114 | 5.4% |
| E | 1440961 | 5.2% |
| e | 1263386 | 4.5% |
| a | 1231686 | 4.4% |
| n | 1170611 | 4.2% |
| o | 1124916 | 4.0% |
| T | 926688 | 3.3% |
| Other values (63) | 13157405 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 27915172 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2228083 | 8.0% | |
| S | 2093987 | 7.5% |
| t | 1762335 | 6.3% |
| i | 1515114 | 5.4% |
| E | 1440961 | 5.2% |
| e | 1263386 | 4.5% |
| a | 1231686 | 4.4% |
| n | 1170611 | 4.2% |
| o | 1124916 | 4.0% |
| T | 926688 | 3.3% |
| Other values (63) | 13157405 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 27915172 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2228083 | 8.0% | |
| S | 2093987 | 7.5% |
| t | 1762335 | 6.3% |
| i | 1515114 | 5.4% |
| E | 1440961 | 5.2% |
| e | 1263386 | 4.5% |
| a | 1231686 | 4.4% |
| n | 1170611 | 4.2% |
| o | 1124916 | 4.0% |
| T | 926688 | 3.3% |
| Other values (63) | 13157405 |
Missing 
| Distinct | 286 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 2019068 |
| Missing (%) | 93.1% |
| Memory size | 72.3 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 30 |
| Mean length | 17.66163 |
| Min length | 2 |
Unique
| Unique | 171 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Sedan |
|---|---|
| 2nd row | Sedan |
| 3rd row | Station Wagon/Sport Utility Vehicle |
| 4th row | Sedan |
| 5th row | Sedan |
| Value | Count | Frequency (%) |
| vehicle | 67310 | |
| utility | 52522 | |
| station | 52519 | |
| sedan | 50913 | |
| wagon/sport | 39160 | |
| passenger | 27716 | |
| 13450 | 3.7% | |
| wagon | 13359 | 3.7% |
| sport | 13358 | 3.7% |
| truck | 4722 | 1.3% |
| Other values (231) | 29493 |
Most occurring characters
| Value | Count | Frequency (%) |
| 214338 | 8.1% | |
| S | 210484 | 7.9% |
| t | 197342 | 7.4% |
| i | 163015 | 6.1% |
| a | 133085 | 5.0% |
| e | 132690 | 5.0% |
| n | 130214 | 4.9% |
| o | 120783 | 4.5% |
| E | 116444 | 4.4% |
| l | 79839 | 3.0% |
| Other values (52) | 1161943 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2660177 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 214338 | 8.1% | |
| S | 210484 | 7.9% |
| t | 197342 | 7.4% |
| i | 163015 | 6.1% |
| a | 133085 | 5.0% |
| e | 132690 | 5.0% |
| n | 130214 | 4.9% |
| o | 120783 | 4.5% |
| E | 116444 | 4.4% |
| l | 79839 | 3.0% |
| Other values (52) | 1161943 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2660177 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 214338 | 8.1% | |
| S | 210484 | 7.9% |
| t | 197342 | 7.4% |
| i | 163015 | 6.1% |
| a | 133085 | 5.0% |
| e | 132690 | 5.0% |
| n | 130214 | 4.9% |
| o | 120783 | 4.5% |
| E | 116444 | 4.4% |
| l | 79839 | 3.0% |
| Other values (52) | 1161943 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2660177 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 214338 | 8.1% | |
| S | 210484 | 7.9% |
| t | 197342 | 7.4% |
| i | 163015 | 6.1% |
| a | 133085 | 5.0% |
| e | 132690 | 5.0% |
| n | 130214 | 4.9% |
| o | 120783 | 4.5% |
| E | 116444 | 4.4% |
| l | 79839 | 3.0% |
| Other values (52) | 1161943 |
Missing 
| Distinct | 111 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 2135279 |
| Missing (%) | 98.4% |
| Memory size | 67.6 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 30 |
| Mean length | 18.02607 |
| Min length | 2 |
Unique
| Unique | 54 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Station Wagon/Sport Utility Vehicle |
|---|---|
| 2nd row | Sedan |
| 3rd row | Station Wagon/Sport Utility Vehicle |
| 4th row | Sedan |
| 5th row | Sedan |
| Value | Count | Frequency (%) |
| vehicle | 15835 | |
| station | 12661 | |
| utility | 12661 | |
| sedan | 12388 | |
| wagon/sport | 9809 | |
| passenger | 5970 | 7.1% |
| 2862 | 3.4% | |
| sport | 2852 | 3.4% |
| wagon | 2852 | 3.4% |
| truck | 868 | 1.0% |
| Other values (110) | 5231 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 49637 | 8.0% | |
| S | 49284 | 7.9% |
| t | 49284 | 7.9% |
| i | 40437 | 6.5% |
| a | 32723 | 5.3% |
| e | 32519 | 5.2% |
| n | 32170 | 5.2% |
| o | 29952 | 4.8% |
| E | 24673 | 4.0% |
| l | 19876 | 3.2% |
| Other values (48) | 259686 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 620241 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 49637 | 8.0% | |
| S | 49284 | 7.9% |
| t | 49284 | 7.9% |
| i | 40437 | 6.5% |
| a | 32723 | 5.3% |
| e | 32519 | 5.2% |
| n | 32170 | 5.2% |
| o | 29952 | 4.8% |
| E | 24673 | 4.0% |
| l | 19876 | 3.2% |
| Other values (48) | 259686 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 620241 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 49637 | 8.0% | |
| S | 49284 | 7.9% |
| t | 49284 | 7.9% |
| i | 40437 | 6.5% |
| a | 32723 | 5.3% |
| e | 32519 | 5.2% |
| n | 32170 | 5.2% |
| o | 29952 | 4.8% |
| E | 24673 | 4.0% |
| l | 19876 | 3.2% |
| Other values (48) | 259686 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 620241 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 49637 | 8.0% | |
| S | 49284 | 7.9% |
| t | 49284 | 7.9% |
| i | 40437 | 6.5% |
| a | 32723 | 5.3% |
| e | 32519 | 5.2% |
| n | 32170 | 5.2% |
| o | 29952 | 4.8% |
| E | 24673 | 4.0% |
| l | 19876 | 3.2% |
| Other values (48) | 259686 |
Missing 
| Distinct | 75 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 2160231 |
| Missing (%) | 99.6% |
| Memory size | 66.6 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 30 |
| Mean length | 18.181684 |
| Min length | 2 |
Unique
| Unique | 35 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Station Wagon/Sport Utility Vehicle |
|---|---|
| 2nd row | Station Wagon/Sport Utility Vehicle |
| 3rd row | Sedan |
| 4th row | Sedan |
| 5th row | Station Wagon/Sport Utility Vehicle |
| Value | Count | Frequency (%) |
| vehicle | 4294 | |
| utility | 3600 | |
| station | 3600 | |
| sedan | 3516 | |
| wagon/sport | 2798 | |
| passenger | 1487 | 6.4% |
| 804 | 3.5% | |
| wagon | 804 | 3.5% |
| sport | 802 | 3.5% |
| truck | 272 | 1.2% |
| Other values (75) | 1257 | 5.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 14069 | 8.2% |
| 13788 | 8.0% | |
| S | 13608 | 7.9% |
| i | 11540 | 6.7% |
| a | 9308 | 5.4% |
| e | 9257 | 5.4% |
| n | 9176 | 5.3% |
| o | 8568 | 5.0% |
| E | 6130 | 3.6% |
| l | 5672 | 3.3% |
| Other values (46) | 70810 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 171926 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 14069 | 8.2% |
| 13788 | 8.0% | |
| S | 13608 | 7.9% |
| i | 11540 | 6.7% |
| a | 9308 | 5.4% |
| e | 9257 | 5.4% |
| n | 9176 | 5.3% |
| o | 8568 | 5.0% |
| E | 6130 | 3.6% |
| l | 5672 | 3.3% |
| Other values (46) | 70810 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 171926 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 14069 | 8.2% |
| 13788 | 8.0% | |
| S | 13608 | 7.9% |
| i | 11540 | 6.7% |
| a | 9308 | 5.4% |
| e | 9257 | 5.4% |
| n | 9176 | 5.3% |
| o | 8568 | 5.0% |
| E | 6130 | 3.6% |
| l | 5672 | 3.3% |
| Other values (46) | 70810 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 171926 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 14069 | 8.2% |
| 13788 | 8.0% | |
| S | 13608 | 7.9% |
| i | 11540 | 6.7% |
| a | 9308 | 5.4% |
| e | 9257 | 5.4% |
| n | 9176 | 5.3% |
| o | 8568 | 5.0% |
| E | 6130 | 3.6% |
| l | 5672 | 3.3% |
| Other values (46) | 70810 |
Interactions
Correlations
| BOROUGH | COLLISION_ID | CONTRIBUTING FACTOR VEHICLE 4 | CONTRIBUTING FACTOR VEHICLE 5 | LATITUDE | LONGITUDE | NUMBER OF CYCLIST INJURED | NUMBER OF CYCLIST KILLED | NUMBER OF MOTORIST INJURED | NUMBER OF MOTORIST KILLED | NUMBER OF PEDESTRIANS INJURED | NUMBER OF PEDESTRIANS KILLED | NUMBER OF PERSONS INJURED | NUMBER OF PERSONS KILLED | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BOROUGH | 1.000 | 0.054 | 0.050 | 0.045 | 0.006 | 0.006 | 0.028 | 0.001 | 0.008 | 0.004 | 0.002 | 0.000 | 0.008 | 0.002 |
| COLLISION_ID | 0.054 | 1.000 | 0.067 | 0.078 | -0.016 | 0.068 | 0.039 | 0.004 | 0.117 | 0.008 | 0.039 | 0.005 | 0.153 | 0.011 |
| CONTRIBUTING FACTOR VEHICLE 4 | 0.050 | 0.067 | 1.000 | 0.690 | 0.000 | 0.000 | 0.000 | 0.000 | 0.022 | 0.000 | 0.143 | 0.000 | 0.025 | 0.000 |
| CONTRIBUTING FACTOR VEHICLE 5 | 0.045 | 0.078 | 0.690 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.040 | 0.000 | 0.000 | 0.000 | 0.038 | 0.000 |
| LATITUDE | 0.006 | -0.016 | 0.000 | 0.000 | 1.000 | 0.285 | 0.003 | 0.000 | -0.032 | -0.001 | 0.003 | -0.001 | -0.026 | -0.001 |
| LONGITUDE | 0.006 | 0.068 | 0.000 | 0.000 | 0.285 | 1.000 | 0.002 | 0.000 | 0.075 | 0.006 | -0.014 | 0.001 | 0.039 | 0.003 |
| NUMBER OF CYCLIST INJURED | 0.028 | 0.039 | 0.000 | 0.000 | 0.003 | 0.002 | 1.000 | 0.018 | 0.004 | 0.001 | 0.000 | 0.002 | 0.004 | 0.005 |
| NUMBER OF CYCLIST KILLED | 0.001 | 0.004 | 0.000 | 0.000 | 0.000 | 0.000 | 0.018 | 1.000 | 0.000 | 0.000 | 0.162 | 0.707 | 0.040 | 0.736 |
| NUMBER OF MOTORIST INJURED | 0.008 | 0.117 | 0.022 | 0.040 | -0.032 | 0.075 | 0.004 | 0.000 | 1.000 | 0.018 | -0.092 | -0.003 | 0.781 | 0.008 |
| NUMBER OF MOTORIST KILLED | 0.004 | 0.008 | 0.000 | 0.000 | -0.001 | 0.006 | 0.001 | 0.000 | 0.018 | 1.000 | -0.004 | 0.003 | 0.012 | 0.623 |
| NUMBER OF PEDESTRIANS INJURED | 0.002 | 0.039 | 0.143 | 0.000 | 0.003 | -0.014 | 0.000 | 0.162 | -0.092 | -0.004 | 1.000 | 0.002 | 0.412 | -0.002 |
| NUMBER OF PEDESTRIANS KILLED | 0.000 | 0.005 | 0.000 | 0.000 | -0.001 | 0.001 | 0.002 | 0.707 | -0.003 | 0.003 | 0.002 | 1.000 | -0.005 | 0.714 |
| NUMBER OF PERSONS INJURED | 0.008 | 0.153 | 0.025 | 0.038 | -0.026 | 0.039 | 0.004 | 0.040 | 0.781 | 0.012 | 0.412 | -0.005 | 1.000 | 0.003 |
| NUMBER OF PERSONS KILLED | 0.002 | 0.011 | 0.000 | 0.000 | -0.001 | 0.003 | 0.005 | 0.736 | 0.008 | 0.623 | -0.002 | 0.714 | 0.003 | 1.000 |
Missing values
Sample
| CRASH DATE | CRASH TIME | BOROUGH | ZIP CODE | LATITUDE | LONGITUDE | LOCATION | ON STREET NAME | CROSS STREET NAME | OFF STREET NAME | NUMBER OF PERSONS INJURED | NUMBER OF PERSONS KILLED | NUMBER OF PEDESTRIANS INJURED | NUMBER OF PEDESTRIANS KILLED | NUMBER OF CYCLIST INJURED | NUMBER OF CYCLIST KILLED | NUMBER OF MOTORIST INJURED | NUMBER OF MOTORIST KILLED | CONTRIBUTING FACTOR VEHICLE 1 | CONTRIBUTING FACTOR VEHICLE 2 | CONTRIBUTING FACTOR VEHICLE 3 | CONTRIBUTING FACTOR VEHICLE 4 | CONTRIBUTING FACTOR VEHICLE 5 | COLLISION_ID | VEHICLE TYPE CODE 1 | VEHICLE TYPE CODE 2 | VEHICLE TYPE CODE 3 | VEHICLE TYPE CODE 4 | VEHICLE TYPE CODE 5 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 09/11/2021 | 2:39 | NaN | NaN | NaN | NaN | NaN | WHITESTONE EXPRESSWAY | 20 AVENUE | NaN | 2.0 | 0.0 | 0 | 0 | 0 | 0 | 2 | 0 | Aggressive Driving/Road Rage | Unspecified | NaN | NaN | NaN | 4455765 | Sedan | Sedan | NaN | NaN | NaN |
| 1 | 03/26/2022 | 11:45 | NaN | NaN | NaN | NaN | NaN | QUEENSBORO BRIDGE UPPER | NaN | NaN | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Pavement Slippery | NaN | NaN | NaN | NaN | 4513547 | Sedan | NaN | NaN | NaN | NaN |
| 2 | 11/01/2023 | 1:29 | BROOKLYN | 11230.0 | 40.621790 | -73.970024 | (40.62179, -73.970024) | OCEAN PARKWAY | AVENUE K | NaN | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Unspecified | Unspecified | Unspecified | NaN | NaN | 4675373 | Moped | Sedan | Sedan | NaN | NaN |
| 3 | 06/29/2022 | 6:55 | NaN | NaN | NaN | NaN | NaN | THROGS NECK BRIDGE | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Following Too Closely | Unspecified | NaN | NaN | NaN | 4541903 | Sedan | Pick-up Truck | NaN | NaN | NaN |
| 4 | 09/21/2022 | 13:21 | NaN | NaN | NaN | NaN | NaN | BROOKLYN BRIDGE | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Passing Too Closely | Unspecified | NaN | NaN | NaN | 4566131 | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN | NaN |
| 5 | 04/26/2023 | 13:30 | NaN | NaN | NaN | NaN | NaN | WEST 54 STREET | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | Unspecified | NaN | NaN | NaN | 4623759 | Sedan | Box Truck | NaN | NaN | NaN |
| 6 | 11/01/2023 | 7:12 | NaN | NaN | NaN | NaN | NaN | HUTCHINSON RIVER PARKWAY | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Following Too Closely | Driver Inattention/Distraction | NaN | NaN | NaN | 4675709 | Sedan | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN |
| 7 | 11/01/2023 | 8:01 | NaN | NaN | NaN | NaN | NaN | WEST 35 STREET | HENRY HUDSON RIVER | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Failure to Yield Right-of-Way | NaN | NaN | NaN | NaN | 4675769 | Sedan | NaN | NaN | NaN | NaN |
| 8 | 04/26/2023 | 22:20 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 61 Ed Koch queensborough bridge | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | NaN | NaN | NaN | NaN | 4623865 | Sedan | Pick-up Truck | NaN | NaN | NaN |
| 9 | 09/11/2021 | 9:35 | BROOKLYN | 11208.0 | 40.667202 | -73.866500 | (40.667202, -73.8665) | NaN | NaN | 1211 LORING AVENUE | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | NaN | NaN | NaN | NaN | 4456314 | Sedan | NaN | NaN | NaN | NaN |
| CRASH DATE | CRASH TIME | BOROUGH | ZIP CODE | LATITUDE | LONGITUDE | LOCATION | ON STREET NAME | CROSS STREET NAME | OFF STREET NAME | NUMBER OF PERSONS INJURED | NUMBER OF PERSONS KILLED | NUMBER OF PEDESTRIANS INJURED | NUMBER OF PEDESTRIANS KILLED | NUMBER OF CYCLIST INJURED | NUMBER OF CYCLIST KILLED | NUMBER OF MOTORIST INJURED | NUMBER OF MOTORIST KILLED | CONTRIBUTING FACTOR VEHICLE 1 | CONTRIBUTING FACTOR VEHICLE 2 | CONTRIBUTING FACTOR VEHICLE 3 | CONTRIBUTING FACTOR VEHICLE 4 | CONTRIBUTING FACTOR VEHICLE 5 | COLLISION_ID | VEHICLE TYPE CODE 1 | VEHICLE TYPE CODE 2 | VEHICLE TYPE CODE 3 | VEHICLE TYPE CODE 4 | VEHICLE TYPE CODE 5 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2169677 | 04/15/2025 | 15:52 | BRONX | 10461.0 | 40.854298 | -73.85492 | (40.854298, -73.85492) | NaN | NaN | 2007 WILLIAMSBRIDGE RD | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inattention/Distraction | Unspecified | NaN | NaN | NaN | 4805948 | Sedan | Sedan | NaN | NaN | NaN |
| 2169678 | 04/15/2025 | 20:00 | QUEENS | 11366.0 | 40.728012 | -73.78483 | (40.728012, -73.78483) | UNION TPKE | 184 ST | NaN | 2.0 | 0.0 | 0 | 0 | 0 | 0 | 2 | 0 | Traffic Control Disregarded | Unspecified | NaN | NaN | NaN | 4806383 | Station Wagon/Sport Utility Vehicle | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN |
| 2169679 | 04/15/2025 | 14:30 | MANHATTAN | 10036.0 | 40.757553 | -73.98551 | (40.757553, -73.98551) | NaN | NaN | 1516 BROADWAY | 1.0 | 0.0 | 1 | 0 | 0 | 0 | 0 | 0 | Traffic Control Disregarded | NaN | NaN | NaN | NaN | 4806096 | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN | NaN |
| 2169680 | 04/15/2025 | 23:20 | QUEENS | 11691.0 | 40.610480 | -73.75028 | (40.61048, -73.75028) | NaN | NaN | 12-50 REDFERN AVE | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | View Obstructed/Limited | NaN | NaN | NaN | NaN | 4806081 | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN | NaN |
| 2169681 | 04/07/2025 | 8:50 | BROOKLYN | 11221.0 | 40.695114 | -73.91186 | (40.695114, -73.91186) | PUTNAM AVE | KNICKERBOCKER AVE | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Backing Unsafely | Unspecified | NaN | NaN | NaN | 4806432 | Sedan | NaN | NaN | NaN | NaN |
| 2169682 | 04/15/2025 | 5:58 | NaN | NaN | 40.761272 | -73.95571 | (40.761272, -73.95571) | FDR DRIVE | NaN | NaN | 2.0 | 0.0 | 1 | 0 | 0 | 0 | 1 | 0 | Driver Inattention/Distraction | Unspecified | NaN | NaN | NaN | 4806221 | Station Wagon/Sport Utility Vehicle | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN |
| 2169683 | 04/14/2025 | 19:22 | STATEN ISLAND | 10304.0 | 40.601810 | -74.09283 | (40.60181, -74.09283) | RICHMOND RD | ROME AVE | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inattention/Distraction | NaN | NaN | NaN | NaN | 4806275 | NaN | NaN | NaN | NaN | NaN |
| 2169684 | 04/14/2025 | 21:25 | QUEENS | 11436.0 | 40.675716 | -73.79124 | (40.675716, -73.79124) | NaN | NaN | 147-06 123 AVE | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Turning Improperly | Unspecified | NaN | NaN | NaN | 4806294 | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN | NaN |
| 2169685 | 04/15/2025 | 13:56 | MANHATTAN | 10000.0 | 0.000000 | 0.00000 | (0.0, 0.0) | NaN | NaN | 90-02 EAST DR | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Following Too Closely | Unspecified | NaN | NaN | NaN | 4806171 | Bike | Bike | NaN | NaN | NaN |
| 2169686 | 03/23/2025 | 13:00 | BRONX | 10462.0 | 40.836330 | -73.85505 | (40.83633, -73.85505) | NaN | NaN | 1502 OLMSTEAD AVE | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | Unspecified | NaN | NaN | NaN | 4806253 | Sedan | NaN | NaN | NaN | NaN |